SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Wang, Elizabeth A. 

Rosen, Vicki A. 
Wozney, John M. 

(ii) TITLE OF INVENTION: Novel BMP Products 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LEGAL AFFAIRS, GENETICS INSTITUTE, INC. 

(B) STREET: 87 CAMBRIDGEPARK DRIVE 

(C) CITY: CAMBRIDGE 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02140 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

~; (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

:*f (D) SOFTWARE: Patentin Release #1.0, Version #1.25 

C (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

Ji; (C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: KAPINOS, ELLEN J. 

(B) REGISTRATION NUMBER: 32,245 

Q (C) REFERENCE/DOCKET NUMBER: GI5160C 

(ix) TELECOMMUNICATION INFORMATION: 
rf (A) TELEPHONE: 617-876-1170 

(B) TELEFAX: 617-876-5851 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: BOS TAURUS 

(vii) IMMEDIATE SOURCE: 



(A) LIBRARY :^PVINB GENOMIC IN LAMBDA J 

(B) CLONE: LAMBDA BP-21 



(viii) POSITION IN GENOME: 
(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..390 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 46.. 387 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

3GC CAC GAT GGG AAA GGA CAC CCT CTC CAC AGA AGA GAA AAG CGG CAA 48 
31y His Asp Gly Lys Gly His Pro Leu His Arg Arg Glu Lys Arg Gin 
-15 -10 -5 1 

3CA AAA CAC AAA CAG CGG AAA CGC CTC AAG TCC AGC TGT AAG AGA CAC 96 
\la Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His 

CI 5 10 15 

CCThiTTA TAT GTG GAC TTC AGT GAT GTG GGG TGG AAT GAC TGG ATC GTT 144 
Pra=:Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp lie Val 
I 20 25 30 

SCJ^ CCG CCG GGG TAT CAT GCC TTT TAC TGC CAT GGG GAG TGC CCT TTT 192 
Ma:;!Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe 
35 40 45 

2CC = iCTG GCC GAT CAC CTT AAC TCC ACG AAT CAT GCC ATT CTC CAA ACT 240 
ProjjJLeu Ala Asp His Leu Asn Ser Thr Asn His Ala lie Leu Gin Thr 

55 60 65 

CTG::GTC AAC TCA GTT AAC TCT AAG ATT CCC AAG*GCA TGC TGT GTC CCA 288 
Leut^Val Asn Ser Val Asn Ser Lys lie Pro Lys Ala Cys Cys Val Pro 

70 75 80 

\CA GAG CTC AGC GCC ATC TCC ATG CTG TAC CTT GAT GAG AAT GAG AAG 336 
rhr Glu Leu Ser Ala lie Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys 
85 90 95 

3TG GTA TTA AAG AAC TAT CAG GAC ATG GTT GTC GAG GGT TGT GGG TGT 384 
/al Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys 
100 105 110 

3GT TAGCACAGCA AAATAAAATA TAAATATATA TATATATATA TTAGAAAAAC 437 
\rg 

115 

AGCAAAAAAA TCAAGTTGAC ACTTTAATAT TTCCCAATGA AGACTTTATT TATGGAATGG 497 

W^TGGAGAAA AAGAAAAACA CAGCTATTTT GAAAACTATA TTTATATCTA CCGAAAAGAA 557 

3TTGGGAAAA CAAATATTTT AATCAGAGAA TTATT 59 2 



(2) INFORMATION FOR 



4i^ID no: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ'ID NO: 2: 

Gly His Asp Gly Lys Gly His Pro Leu His Arg Arg Glu Lys Arg Gin 
-15 -10 ^ ^ 1 

Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His 
5 10 15 

Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp He Val 
20 25 30 

Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe 
35 40 45 

Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala 11^ -teu Gin Thr 
1? 5^ 60 65 

iMa Val Asn Ser Val Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro 
5; '^^ 75 80 

TfijT Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys 
85 90 95 

Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys 
O 100 105 110 



(1) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(G) CELL TYPE: Osteosarcoma Cell Line 

(H) CELL LINE: U-20S 



(vii) 



IMMEDIATE SOURCE: 
(A) LIBRARY: U20S cDNA in Lambda GTIO 



(B) CLONE: Iipbda U^OS-39 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356.. 1546 

(ix) FEATURE: 

(A) NAME/KEY: mat_j>eptide 

(B) LOCATION: 1202.. 1543 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 14.. 1607 

(ix) FEATURE: 

(A) NAME/KEY: sig_j>eptide 

(B) LOCATION: 356.. 424 



K'i (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

fflCGACTCTA GAGTGTGTGT CAGCACTTGG CTGGGGACTT CTTGAACTTG CAGGGAGAAT 60 

AjjqTTGCGCA CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGGAGCCTGC TTCGCCATCT 120 

C<i^AGCCCGA CCGCCCCTCC ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC 180 

G-nSAAAAGAG AGACTGCGCG GCCGGCACCC GGGAGAAGGA GGAGGCAAAG AAAAGGAACG 240 

GJ^jATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT TCCATGTGGA CGCTCTTTCA 300 

AO^GACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT CGACC ATG 358 

i; . Met 

-282 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC CTC 406 
Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
-280 -275 -270 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 454 
Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
-265 -260 -255 -250 

GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 502 
Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val 

-245 -240 -235 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 550 
Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
-230 -225 -220 

AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 598 
Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 
-215 -210 -205 

CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 646 



i^jky v»in Pro Gly Ser Pro Alflli 



Leu Tyr Arg Arg His Se JIPLy v»in Pro Gly Ser Pro AlVPro -*sp His 
-200 -195 -190 

ZGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 694 

keg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe His 
-185 -180 -175 -170 

:AT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 742 
lis Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 

-165 -«160 -155 

:GG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 790 
\rg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe lie 
-150 -145 -140 

\CC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 838 
rhr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 
-135 -130 -125 

3GA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 886 
lly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie lie 
-120 -115 -110 

\M, CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 934 
Lys; Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 
-IBS -100 -95 -90 

ACC; AGG TTG GTG AAT CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT GTC 982 
rhr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp Val 

-85 -80 -75 

ACW CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 1030 
rhfc Pro Ala val Met Arg Trp Thr Ala Gin Gly His Ala Asn His Gly 
-70 -65 -60 

Vrk GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 1078 
PhtP Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys 
p -55 -50 * -45 

AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 1126 
Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
-40 -35 -30 

rCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 1174 
Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
-25 -20 -15 -10 

CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 1222 
His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 

-5 15 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 1270 
Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 
10 15 20 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC 1318 
Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His 
25 30 35 



see TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 



1366 



ila Phe Tyr Cys His Gl^Bu cys Pro Phe Pro Leu All^Fsp ULs Leu 

40 45 50 55 



\AC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC 1414 

isn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn 

60 65 70 

?CT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 1462 
>er Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
75 80 ' 85 

:CG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 1510 
Jer Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
90 95 100 

UiG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 1563 
;in Asp Met Val Val Glu Gly Cys Gly Cys Arg 

105 110 115 

lATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAA 1607 
2) INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
-,: (D) TOPOLOGY: linear 

J' (ii) MOLECULE TYPE: protein 

1=1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

letiiyal Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
•282;; -280 -275 -270 

jeulLeu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
£-265 -260 ' -255 

he Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
250 -245 -240 -235 

lal Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 

-230 -225 -220 

Jin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
-215 -210 -205 

isp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
-200 -195 -190 

lis Arg Leu Glu Arg Ala^Alaf Ser Arg Ala Asn Thr Val Arg Ser Phe 
-185 ^-*180 -175 

/"^ 

lis His Glu Glri^, Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
-170 -165 -160 -155 

Chr Arg Arg Phe Phe Phe Asn Leu Ser Ser He Pro Thr Glu Glu Phe 

-150 -145 -140 



s^^l^ln «al Phe Arg Glu Gin mJH^] 



He Thr Ser Ala Glu Le^jpin «al Phe Arg Glu Gin Ml^&ln ^sp Ala 
-135 -130 -125 

Leu 61y Asn Asn Ser Ser Phe His His Arg lie Asn He Tyr Glu He 
-120 -115 -110 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
-105 -100 -95 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp 
-90 -85 -80 -75 

Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 

-70 -65 -60 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
-55 -50 -45 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
-40 -35 -30 

Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
-25 -20 -15 

GlylHis Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 
-10^ -5 15 

ArgrfLys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 
± 10 15 20 

PhditJSer Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr 

fi 25 30 35 

HiCAla Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 

hi 40 45 50 

LeuF;Asn Ser Thr Asn His Ala He Val Gin Thr^Leu Val Asn Ser Val 
5^5 60 65* 70 

Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 

75 80 85 

He Ser Met Leu Tyr Leu Asp Glu ksn Glu Lys Val Val Leu Lys Asn 
90 95 100 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
105 110 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hoao Sapiens 

(G) CELL TYPE: Osteosarcoma Cell Line 

(H) CELL LINE: U-20S 

(Vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U20S cDNA in Bambda gtlO 

(B) CLONE: Lambda U20S-3 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 403.. 1629 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1279.. 1626 

: 1 (ix) FEATURE: 

(A) NAME/KEY: mRNA 
n] (B) LOCATION: 9.. 1934 

j (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CT^AGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA 60 

3GTGAGTGTG GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG 120 

AGTATCTAGC TTGTCTCCCC GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC 180 

ACAfTCCCCG GCCCTCGCCC AGGTTCACTG CAACCGTTCA GAGGTCCCCA GGAGCTGCTG 240 

CTGOCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC GTAGTGCCAT CCCGAGCAAC 300 

3CACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG CTGTCAAGAA 360 

TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT GGT 414 

Met lie Pro Gly 
-292 -290 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 462 
Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
-285 -280 -275 

GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 
Ala Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys Val Ala 
-270 -265 -260 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 
Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
-255 -250 -245 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 



-240 



-230 -225 



CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CC6 GAC TAC ATG CGG 654 
hrg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro Asp Tyr Met Arg 

-220 -215 -210 

SAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 702 
ksp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin He His 
-205 -200 -195 

^GC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 750 
3er Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
-190 -185 -180 

3TG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
7al Arg Ser Fhe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
-175 -170 -165 

\GT GAA AAC TCT 6CT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 846 
Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
-160 -155 -150 -145 

SAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 894 
31u.Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
r: -140 -135 -130 

3T<|;jGAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 942 
7al=Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He 
-125 -120 -115 

rAflJGAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 990 
ryrilGlu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
-110 -105 -100 

ACA?CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 1038 
rhriArg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
S-95 -90 -85 

SAAlACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 1086 
SIC Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
-80 -75 -70 -65 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 1134 
Sin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 

-60 -55 -50 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 1182 
krg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
-45 -40 -35 

SGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 123 0 

Sly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
-30 -25 -20 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 1278 
His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
-15 -10 -5 

\GC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 1326 
3er Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 



10 W 15 

CGG CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 
Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 
20 25 30 

TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC 
Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 
35 40 45 



TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He 
50 55 60 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala Cys 
65 70 75 80 

TGT GTG CCC ACT GAA CTG AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 
Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu 

85 90 95 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 
Tyx Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 
100 105 110 

T# GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 
Cys Gly Cys Arg 

iS 115 

C^jsfACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 
A^GACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 
A^CCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 
T|ATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT^ TAAAAGAAAA AAATAAAATG 
AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

^■'•y Met Val Val Leu Leu Cys Gin Val 

-292 -290 _285 -280 

Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 
"275 -270 -265 

Lys Lys Val Ala Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
"260 -255 -250 -24 



Gin Ser His Glu Leu Leu^urg Asp Phe Glu Ala Thr LeuTLeu Gin Met 

-240 -235 -230 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro 
-225 -220 -215 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 
-210 -205 -200 

Glu Gin lie His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
-195 -190 -185 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
-180 -175 -170 -165 

lie Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 

-160 -155 -150 

Ser Ser lie Pro Glu Asn Glu Val lie Ser Ser Ala Glu Leu Arg Leu 
-145 -140 -135 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
-130 -125 -120 

Argille Asn lie Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
::lrll5 -110 -105 

GlyliSis Leu lie Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
-10|= -95 -90 -85 

ValJThr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 

-80 -75 -70 



ThCjArg Glu Lys Gin Pro Asn Tyr Gly Leu Ala lie Glu Val Thr His 
-65 -60 -55 

Leu=kis Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 
-50 -45 -40 

ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
-35 -30 -25 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
-20 -15 -10 -5 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 

15 10 

Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
15 20 25 

Sly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 
30 35 40 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
45 50 55 60 

^sn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He 

65 70 75 



>ro Lys Ala Cys Cys VarTro Thr Glu Leu Ser Ala He Ser Met Leu 
80 85 90 

Vr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
95 100 105 

'al Val Glu Gly Cys Gly Cys Arg 
110 115 



I 



